Integrating Data Warehouses with Web Data for Olap Using Semantic Data Clustering Techniques

نویسنده

  • C. Chandrasekar
چکیده

Nowadays, Information retrieval plays an important role in the web. Many researches presented techniques for information retrieval process from databases. The previous work presented extended tree pattern clustering process for XML massive storages. This paper presents a new technique termed semantic data clustering (SDC) technique for combining the Data warehouse and web data for OLAP by retrieving the semantic data from DW (Data Warehouse). Usually, XML technologies are used to store, retrieve, integrate and combine the web data and the applications in Data warehouse. Using semantic data clustering technique, the semantic data repositories are retrieved from DW, which is the devise of multidimensional databases for XML data sources, and the XML additions of OnLine Analytical Processing (OLAP) techniques. SDC will efficiently tackle the information retrieval process in a DW to utilize textrich document collections. For the XML data sources, the SDC will build the tree pattern for a clustered XML schema to retrieve the massive storage of data for OLAP. SDC uses clustering technique for building tree-pattern framework in order to use massive XML databases to data warehouse for OLAP. We also show the advantages of using semantic data clustering for building the treepattern in handling large amounts of XML documents for OLAP in data warehouse. A reliable performance improvement is achieved with the proposed SDC in XML database to data warehouse, compared to an existing ETC technique for XML storages, in terms of building time, query execution time for deriving the semantic data from DW, effectiveness of clustering process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for a Multidimensional OLAP Model using Topic Maps

The goal of a data warehouse is to integrate applications at the data level. Data warehouse architectures are developing in response to our increasing data and information requirements. The traditional notion of data warehouses is evolving into a federated warehouse augmented by a set of processes and services to support integrated and consistent access to heterogeneous, decentralized warehouse...

متن کامل

A Foundation for Spatial Data Warehouses on the Semantic Web

Large volumes of geospatial data are being published on the Semantic Web (SW), yielding a need for advanced analysis of such data. However, existing SW technologies only support advanced analytical concepts such as multidimensional (MD) data warehouses and Online Analytical Processing (OLAP) over non-spatial SW data. To remedy this need, this paper presents the QB4SOLAP vocabulary, which suppor...

متن کامل

Modeling and Querying Data Cubes on the Semantic Web

The web is changing the way in which data warehouses are designed, used, and queried. With the advent of initiatives such as Open Data and Open Government, organizations want to share their multidimensional data cubes and make them available to be queried online. The RDF data cube vocabulary (QB), the W3C standard to publish statistical data in RDF, presents several limitations to fully support...

متن کامل

Modeling and Querying Spatial Data Warehouses on the Semantic Web

The Semantic Web (SW) has drawn the attention of data enthusiasts, and also inspired the exploitation and design of multidimensional data warehouses, in an unconventional way. Traditional data warehouses (DW) operate over static data. However multidimensional (MD) data modeling approach can be dynamically extended by defining both the schema and instances of MD data as RDF graphs. The importanc...

متن کامل

QB4OLAP: A Vocabulary for OLAP Cubes on the Semantic Web

On-Line Analytical Processing (OLAP) tools allow querying large multidimensional (MD) databases called data warehouses (DW). OLAP-style data analysis over the semantic web (SW) is gaining momentum, and thus SW technologies will be needed to model, manipulate, and share MD data. To achieve this, the definition of a vocabulary that adequately represents OLAP data is required. Unfortunately, so fa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012